Repeats and Palindromes: an Overview

نویسندگان

  • Sara H. Geizhals
  • Dina Sokol
چکیده

With a long text string like DNA, repeats and palindromes are not easily spotted. Yet nding such substrings is important; for instance, repeats in DNA are indicators of certain hereditary disorders and are used as genetic markers. We discuss repeats and then palindromes and then we relate the two. In our discussion of repeats, we rst de ne an exact repeat and then ve de nitions of approximate repeats. We mention algorithms that search a text string for substrings that satisfy these six de nitions. In addition, we categorize the ve approximate repeats in ve di erent ways. When we look at palindromes, we look at Manacher's algorithm to nd the longest exact palindrome in a string and also an algorithm that nds the longest approximate palindrome in compressed data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Upper Bounds on all Maximal $\alpha$-gapped Repeats and Palindromes

We show that the number of all maximal α-gapped repeats and palindromes of a word of length n is at most 3(π/6 + 5/2)αn and 7(π/6 + 1/2)αn − 5n− 1, respectively.

متن کامل

In-Depth Coverage of the Icon Programming Language and Applications Constant Square-Root Palindromes

In the last issue of the Analyst, we described an application, which we have named qirplore, for exploring the space of square-root palindromes — the palindromic parts of the repeats in continuedfraction sequences for square roots [1]. In this article we’ll use qirplore to gather information about constant square-root palindromes — palindromes in which all terms are the same — and then deduce s...

متن کامل

The effect of the length of direct repeats and the presence of palindromes on deletion between directly repeated DNA sequences in bacteriophage T7.

The frequency of genetic deletion between directly repeated DNA sequences in bacteriophage T7 was measured as a function of the length of the direct repeat. The non-essential ligase gene (gene 1.3) of bacteriophage T7 was interrupted with pieces of synthetic DNA bracketed by direct repeats of various lengths. Deletion of these 76 bp long inserts was too low to be measured when the direct repeat...

متن کامل

Development of a Webbased Application to Detect Palindromes in Dna Sequences

Detecting palindromes in DNA sequence is a central problem in computational biology. Identifying palindromes could help scientists advance the understanding of genomic instability. DNA sequences containing long adjacent inverted repeats (palindromes) are inherently unstable and are associated with many types of chromosomal rearrangements. In this paper, we present a simple web-base tool to assi...

متن کامل

Repeat Sequences and Base Correlations in Human Y Chromosome Palindromes

On the basis of information theory and statistical methods, we use mutual information, ntuple entropy and conditional entropy, combined with biological characteristics, to analyze the long range correlation and short range correlation in human Y chromosome palindromes. The magnitude distribution of the long range correlation which can be reflected by the mutual information is P5>P5a>P5b (P5a an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014